Posterior Simulation for Feed Forward Neural Network Models
نویسنده
چکیده
1 Motivation We are interested in Bayesian inference and prediction with feed-forward neural network models (FFNN's), speciically those with one hidden layer with M hidden nodes, p input nodes, 1 output node and logistic activation functions: we try to predict a variable y in terms of p variables x = (x 1 ; :::; x p), with regression function y(x) = P M i=1 j (j x) where (z) = exp(z) 1+exp(z). These and other neural network models are the central theme of recent research. Statistical introductions may be seen in Cheng and Titterington (1994) and Ripley (1993). Neural networks are typically presented as black box models to deal with nonlinear features in problems like regression, forecasting and classiication. Incorporating prior knowledge in those models should enhance their performance. This naturally begs for a Bayesian approach, see Buntine and Weigend for some views. Among other advantages, the Bayesian approach allows for coherent incorporation of all uncertainties, hence permitting coherent procedures to network architecture choice, one of the main problems in NN research. However, it leads to diicult computational problems, stemming from non-normality and multimodality of posterior distributions, which hinder the use of methods like Laplace integration, Gaussian quadrature and Monte Carlo importance sampling. Multimodality issues have predated discussions in neu-ral network research, see e.g. Ripley (1993), and are relevant as well for mixture models, see West, M uller and Escobar (1994) and Crawford (1994), of which FFNN's are a special case. There are three main reasons for multimodality of posterior models in FFNN's. The rst one is symmetries due to relabeling; we mitigate this problem introducing appropriate inequality constraints among parameters. The second, and most worrisome, is the inclusion of several copies of the same term, in our case, terms with the same vector. Node duplication may be
منابع مشابه
Neural Network Meta-Modeling of Steam Assisted Gravity Drainage Oil Recovery Processes
Production of highly viscous tar sand bitumen using Steam Assisted Gravity Drainage (SAGD) with a pair of horizontal wells has advantages over conventional steam flooding. This paper explores the use of Artificial Neural Networks (ANNs) as an alternative to the traditional SAGD simulation approach. Feed forward, multi-layered neural network meta-models are trained through the Back-...
متن کاملPrediction of the Effect of Polymer Membrane Composition in a Dry Air Humidification Process via Neural Network Modeling
Utilization of membrane humidifiers is one of the methods commonly used to humidify reactant gases in polymer electrolyte membrane fuel cells (PEMFC). In this study, polymeric porous membranes with different compositions were prepared to be used in a membrane humidifier module and were employed in a humidification test. Three different neural network models were developed to investigate several...
متن کاملSignal Prediction by Layered Feed - Forward Neural Network (RESEARCH NOTE).
In this paper a nonparametric neural network (NN) technique for prediction of future values of a signal based on its past history is presented. This approach bypasses modeling, identification, and parameter estimation phases that are required by conventional parametric techniques. A multi-layer feed forward NN is employed. It develops an internal model of the signal through a training operation...
متن کاملArtificial intelligence-based approaches for multi-station modelling of dissolve oxygen in river
ABSTRACT: In this study, adaptive neuro-fuzzy inference system, and feed forward neural network as two artificial intelligence-based models along with conventional multiple linear regression model were used to predict the multi-station modelling of dissolve oxygen concentration at the downstream of Mathura City in India. The data used are dissolved oxygen, pH, biological oxygen demand and water...
متن کاملFeedforward Neural Networks for Nonparametric Regression
Feed forward neural networks (FFNN) with an unconstrained random number of hidden neurons deene exible non-parametric regression models. In M uller and Rios Insua (1998) we have argued that variable architecture models with random size hidden layer signiicantly reduce posterior mul-timodality typical for posterior distributions in neural network models. In this chapter we review the model propo...
متن کاملModeling SMA actuated systems based on Bouc-Wen hysteresis model and feed-forward neural network
Despite the fact that shape-memory alloy (SMA) has several mechanical advantages as it continues being used as an actuator in engineering applications, using it still remains as a challenge since it shows both non-linear and hysteretic behavior. To improve the efficiency of SMA application, it is required to do research not only on modeling it, but also on control hysteresis behavior of these m...
متن کامل